Verb-particle constructions in a computational grammar of English

نویسندگان

  • Aline Villavicencio
  • Ann Copestake
  • ALINE VILLAVICENCIO
چکیده

In this paper we investigate the phenomenon of verb-particle constructions, discussing their characteristics and the challenges that they present for a computational grammar. We concentrate our discussion on the treatment adopted in a wide-coverage HPSG grammar: the LinGO ERG. Given the constantly growing number of verb-particle combinations, possible ways of extending this treatment are investigated, taking into account the regular patterns found in some productive combinations of verbs and particles. We analyse possible ways of identifying regular patterns using different resources. One possible way to try to capture these is by means of lexical rules, and we discuss the dif£culties encountered when adopting such an approach. We also investigate how to restrict the productivity of lexical rules to deal with subregularities and exceptions to the patterns found. 18.1 Verb-Particle constructions in a nutshell In this paper we investigate verb-particle constructions in English and discuss some of the challenges that they pose for a broad-coverage computational grammar. By verb-particle constructions, we mean both idiosyncratic or semiidiosyncratic combinations, such as make up, in (1), where the meaning of the combination cannot be straightforwardly inferred from the meaning of the verb and the particle, and also more regular combinations, such as tear up, in (2). (1) He knew what he wanted and quickly made up his mind. (2) In a rage she tore up the letter Jack gave her. The Proceedings of the 9th International Conference on HPSG. Jong-Bok Kim and Stephen Wechsler (eds.). Copyright c © 2003, Stanford University.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification of Verb-Particle Constructions in English

We propose different syntax-based methods for automatically identifying verb-particle constructions in English. The methods are based on the Deterministic Finitestate Automaton (DFA), Hidden Markov Model(HMM), and Synchronous ContextFree Grammar (SCFG). Our experiments show that the methods could result in F-score 83.3% over our manually annotated test-set consisting of Wikipedia articles and B...

متن کامل

Baldwin, Timothy (2005) The Deep Lexical Acquisition of English Verb-particle Constructions, Computer Speech and Language, Special Issue on Multiword Expressions, Volume 19, Issue 4, pp. 398-414

This paper proposes a range of techniques for extracting English verb–particle constructions from raw text corpora, complete with valence information. We propose four basic methods, based on the output of a POS tagger, chunker, chunk grammar and dependency parser, respectively. We then present a combined classifier which we show to consolidate the strengths of the component methods.

متن کامل

Complex Predicates are Multi-Word Expressions

Practitioners of English Natural Language Processing often feel fortunate because their tokens are clearly marked by spaces on either side. However, the spaces can be quite deceptive, since they ignore the boundaries of multi-word expressions, such as noun-noun compounds, verb particle constructions, light verb constructions and constructions from Construction Grammar, e.g., caused-motion const...

متن کامل

The two be's of English

This  qualitative  study  investigates  the  uses  of  be  in  Contemporary  English.  Based  on  this  study, one  easy  claim  and  one  more  difficult  claim  are  proposed.  The  easy  claim  is  that  the  traditional distinction between be as a lexical verb and be as an auxiliary is faulty. In particular, 'copular-be', traditionally considered to be a lexical verb, is in fact a prototypi...

متن کامل

Integrating Verb-Particle Constructions into CCG Parsing

Despite their prevalence in the English language, multiword expressions like verb-particle constructions (VPCs) are often poorly handled by NLP systems. This problem is partly due to inadequacies in existing corpora; the primary corpus for CCG-oriented work, CCGbank, does not account for VPCs at all, and is inconsistent in its handling of them. In this paper, we apply some corrective transforma...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002